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Abstract 

Background: The aim of the present study was to identify a specific neuronal correlate underlying the pre- 
attentive auditory stream segregation of subsequent sound patterns alternating in spectral or temporal cues. Fifteen 
participants with normal hearing were presented with series' of two consecutive ABA auditory tone-triplet 
sequences, the initial triplets being the Adaptation sequence and the subsequent triplets being the Test sequence. 
In the first experiment, the frequency separation (delta-f) between A and B tones in the sequences was varied by 2, 
4 and 10 semitones. In the second experiment, a constant delta-f of 6 semitones was maintained but the Inter- 
Stimulus Intervals (ISIs) between A and B tones were varied. Auditory evoked magnetic fields (AEFs) were recorded 
using magnetoencephalography (MEG). Participants watched a muted video of their choice and ignored the 
auditory stimuli. In a subsequent behavioral study both MEG experiments were replicated to provide information 
about the participants' perceptual state. 

Results: MEG measurements showed a significant increase in the amplitude of the B-tone related P1 component of 
the AEFs as delta-f increased. This effect was seen predominantly in the left hemisphere. A significant increase in 
the amplitude of the N1 component was only obtained for a Test sequence delta-f of 10 semitones with a prior 
Adaptation sequence of 2 semitones. This effect was more pronounced in the right hemisphere. The additional 
behavioral data indicated an increased probability of two-stream perception for delta-f =4 and delta-f =10 
semitones with a preceding Adaptation sequence of 2 semitones. However, neither the neural activity nor the 
perception of the successive streaming sequences were modulated when the ISIs were alternated. 

Conclusions: Our MEG experiment demonstrated differences in the behavior of PI and N1 components during the 
automatic segregation of sounds when induced by an initial Adaptation sequence. The P1 component appeared 
enhanced in all Test-conditions and thus demonstrates the preceding context effect, whereas N1 was specifically 
modulated only by large delta-f Test sequences induced by a preceding small delta-f Adaptation sequence. These 
results suggest that PI and N1 components represent at least partially-different systems that underlie the neural 
representation of auditory streaming. 
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Background 

The brain's ability to constantly organize auditory 
objects or "auditory streams" from the competing 
sounds in the environment is a key element in human 
auditory perception. This phenomenon has been labeled 
"stream segregation" or "streaming" by Bergman and 
Campbell [1,2]. The most commonly-used stimulus con- 
figuration for investigating 'streaming^ as described by 
Van Noorden [3], consists of a low-frequency tone A 
and high-frequency tone B presented in a sequence of 
repeated ABA triplets. When the frequency separation 
(Af) between the tones is small and the inter stimulus 
interval (ISI) is long, the sequence is typically heard as 
one sound stream, like a gallop. In the case of a large Af 
and brief ISI the galloping rhythm is no longer heard 
and, instead, the perception switches to two distinct 
sound streams of A and B tones [3,4]. At intermediate 
Af and ISI values, the perception could bias in favor of 
one stream or two streams, depending on participants' 
attention [3,5] and the duration of listening [6]. 

According to the "peripheral channeling hypothesis", 
streaming occurs when different sounds excite adjacent 
peripheral channels with minimal overlapping in the 
cochlea and the auditory nerve [7]. However, the exist- 
ence of additional acoustic cues that influence percep- 
tual organization [6,8-11] as a function of time [2,6], 
suggest the contribution of centrally computed features 
in stream segregation [12-28]. An interaction between 
sub-cortical and cortical auditory structures in switching 
perception from non-streaming to streaming has been 
demonstrated recently [29]. 

Most electrophysiological studies interpret the neural 
correlates of streaming in terms of tonotopic organization 
of the auditory system [5,7,12,14,15,17-19,30-32], and 
Elhilali and colleagues demonstrated the critical role 
played by temporal coherence in the formation of audi- 
tory streams [33]. They showed that frequency-distant 
tones are no longer heard as distinct sound streams if 
presented synchronously rather than successively, al- 
though the enhanced neuronal evoked activity ascribed 
both sequences (synchronous and asynchronous) to 
streaming [33]. Thus, tonotopic organization itself is 
not enough to explain this type of perception; stream- 
ing requires both frequency separation and temporal 
incoherence [33]. In real acoustic environments, such 
as music and speech, however, sounds appear succes- 
sively, and the perception and corresponding evoked 
activity of streaming are highly influenced by recent 
auditory experience [31,34,35]. Furthermore, the encod- 
ing of subsequent sounds can be influenced by 
attention [36,37] and distinct neuronal mechanisms 
are responsible for the automatic segregation of 
sounds and attention-dependent build-up processes 
[8,22,28,38]. 



The present study aimed to investigate the neural 
bases of auditory streaming at a pre-attentive level by 
making alterations to the spectral or temporal features 
of succeeding sound sequences. Our experiments were 
motivated by the hypotheses that selective adaptation 
underlies stream segregation [15,16,23] and that modula- 
tions in auditory evoked activity can indicate this 
process [16,21,22,35,39]. Since streaming is based on 
adaptation to specific tone frequencies [14-16] and is 
strongly determined by the ISI [40], it seems reasonable 
to expect the encoding of subsequent streaming patterns 
to rely on similar physiological mechanisms. Further- 
more, multiple levels of representation in auditory 
stream segregation have been proposed by psychophys- 
ical comparisons of the effects of different types of initial 
adaptation stimuli [23,34]. Neuronal activity in stream- 
ing is also known to be modulated by prior perception 
[28,35]. In an event- related potential study by Sussman 
& Steinschneider, for instance, they showed the absence 
of mismatch negativity to a 'deviant tone' (a tone which 
deviated in frequency) from their repeating ABBB Test 
sequence induced by a priming sequence [28]. Neuro- 
logical responses to the deviant tones are thought to 
arise during streaming and thus the authors concluded 
that no streaming occurred when the Test sequence was 
preceded by a smaller Af Adaptation sequence. No be- 
havioral responses were collected, however, so compari- 
son of behavioural reponses with evoked activity was not 
possible [28]. Another EEG research by Snyder and col- 
leagues found that the adaptation to a prior stimulus 
pattern could underline the effects of functionally dis- 
tinct neural processes in stream segregation [35]. They 
also demonstrated that a large Af in the initial Adapta- 
tion sequence results in reduced streaming on the subse- 
quent Test sequence and vice versa [35]. 

The current study tested the hypotheses that 
frequency-selective adaptation underlies stream segrega- 
tion and that prior adaptation stimulation could recover 
specific neural correlates and streaming effects. We 
measured the auditory evoked fields (AEF) in response 
to an ABA triplet paradigm consisting of successive 
prime Adaptation and subsequent Test sequences. The 
contextual effect between consecutive tone patterns is 
known to decrease as the period of silence between 
them increases [2,31,35,40,41], therefore the neuro- 
physiological demonstration of a clear adaptation effect 
requires continual stimulation. For this reason we chose 
to present our experimental conditions as an uninter- 
rupted trial (without silence between the sequences). In 
order to distinguish between adaptation-based effects 
and those modulated by attention, the participants were 
instructed to concentrate on a soundless movie of their 
choice and ignore the ongoing auditory stimulation. We 
set up two consecutive MEG experiments to manipulate 
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either the frequency separation (Af) or the inter stimulus 
interval (ISI) of the succeeding adaptation and test 
sequences. In addition, behavioral measurements were per- 
formed, whilst replicating all sessions of the MEG study, in 
order to evaluate the participants' perceptual state. 

In the first experiment, Af between the A and B tones 
of the initial Adaptation and subsequent Test sequences 
was altered (small, intermediate or large Af) by varying 
the frequency of the B tones. The frequency of the A 
tones did not vary throughout the entire experiment. 
We anticipated that the presentation of a small Af dur- 
ing the Adaptation sequence would induce increased 
neuronal activity during a subsequent intermediate or 
large Af Test sequence because of the concomitant re- 
duction in neuronal adaptation as Af increases at the be- 
ginning of the Test sequence. We also expected to see 



the complimentary result, that large and intermediate Af 
values during the Adaptation sequence would result in 
decreased neuronal activity during the small Af Test se- 
quence. The ISI between A and B tones did not vary 
during this first trial. 

In the second experiment, a constant Af of 6 semitones 
was used and the ISI was varied. A short ISI in the 
Adaptation sequence was followed by a long ISI in the 
Test sequence and vice versa. In this experiment we spe- 
cifically investigated whether the undergoing adaptation 
effect is dependent upon the specific frequency of the A 
or B tone, or whether particular neurons might be tuned 
to a specific Af-shift, regardless of the actual frequency 
range [2,3]. The process of adaptation in response to 
simple frequencies in the auditory system is known to be 
dependent on the concrete repetition rate of the stimuli 
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Figure 1 Experimental design. This figure illustrates the two different ABA sequences (Adaptation and Test) presented in Experiments 1 and 2, 
plotted as relative Frequency against relative Time. These sequences were presented with randomly-organized changes in position in 
uninterrupted trials. (A) Experiment-1; The Inter Stimulus Interval (ISI) between A and B tones in both sequences was fixed at 125 ms. Three 
different degrees of Af (2, 4 and 1 0 semitones) were presented as either Adaptation or Test sequence. (B) Experiment-2; Af between successive 
A and B tones was fixed at 6 semitones but the ISI varied between short (100 ms) and long (225 ms). 
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[15,16]. Thus, if the expectations proposed above (for Ex- 
periment 1) rely on adaptation to the repeated presentation 
of the A and B tones, changing the rate of presentation be- 
tween the two sequences should disrupt the adaptation 
course and not induce modulation of the neuronal activity 
during the Test-sequence in that experiment. 

Methods 

Test participants 

Fifteen right-handed participants (5 males) aged between 
22 and 30 years participated in this study. None of them 
had a history of otological or neurological disorders. Nor- 
mal audiological status, defined by air conduction hearing 
thresholds of less than 10 dB HL in the frequency range 
between 250 and 4000 Hz was verified by pure tone audi- 
ometry. Participants gave written informed consent to 
take part in the study in accordance with procedures 
approved by the Ethics Commission of the University of 
Miinster and the Declaration of Helsinki. 

MEG experiments 

Two different MEG experiments were carried out using 
ABA tone-pip sequences (sinusoidal tone-pips of 25 ms dur- 
ation, including 10 ms rise and decay times). The loudness 
of the stimuli was set to 60 dB above the individual hearing 
thresholds. Trials were organized as a combination of the 
Adaptation sequence and subsequent Test sequence in an 
uninterrupted trial. In two subsequent experiments we 
investigated the effects of the preceding Adaptation se- 
quence on the following Test sequence by varying (a) fre- 
quency separation (Experiment 1) and (b) inter stimulus 
interval (Experiment 2) (c.f. Figure 1). The duration of each 
trial was 12 s: 6 s for the adaptation and 6 s for the following 
test sequence. In each experiment, 80 trials were presented: 
40 Adaptation sequences and 40 Test sequences. The order- 
ing of the Adaptation and Test sequences was randomly 
organized. The inter trial interval (ITI) was 3 s, thus the 
total recording time of one experimental session was 20 
minutes. 

Experiment 1 consisted of two 20-minute sessions 
during which three different Af values for the ABA tri- 
plets were presented: small (2 semitones), intermediate 
(4 semitones) and large (10 semitones). In each case the 
frequency of tone A was 500 Hz; therefore tone B was 
either 561 Hz for the small Af, 630 Hz for the inter- 
mediate Af or 891 Hz for the large Af, respectively. The 
ISI linking the ABA tones as triplets was constantly set 
to 100 ms; thus the ISI between successive A tones was 
225 ms and between successive B tones was 475 ms. 

In the first session a small Af Adaptation sequence 
was followed by a intermediate Af Test sequence 
and vice versa (c.f. Figure 1A), with the exact order- 
ing of these conditions randomized. The second 



session compared small Af Adaptation with large Af 
Test sequences, and vice versa, in the same way. 

Experiment 2 consisted of one 20-minute session in 
which the ISIs between A and B tones in consecutive 
sequences were changed, opposing long ISI vs. short ISI 
and vice versa (c.f. Figure IB). Af remained at 6 semitones 
(A = 500 Hz and B = 707 Hz) throughout Experiment 2, a 
value that has been previously determined in many studies 
to facilitate the perception of two streams [16,21,22,34,35]. 
In the long ISI condition, the period between the ABA trip- 
let tones was 225 ms; thus, the ISI of successive A tones 
and successive B tones remained at 475 ms and 975 ms, re- 
spectively. In the short ISI condition, the A and B tones 
were presented twice as fast, with an ISI of 100 ms between 
triplet tones. The ISIs between successive A and successive 
B tones were therefore 225 ms and 475 ms, respectively. In 
the same manner as in Experiment 1, a short ISI- 
Adaptation sequence was followed by a long ISI-Test se- 
quence and vice versa (long ISI followed by a short ISI), 
with the exact ordering of these conditions randomized. 

Behavioral experiment 

In the behavioral experiment we used the same experi- 
mental design as the MEG measurements (described 
above, also c.f. Figure 1) to investigate the participants' 
perceptual state. Three participants from the prior MEG- 
measurements could not participate in the behavioral 
tests. Three sessions were carried out, corresponding to 
both MEG-experiments. In the first session a small Af 
Adaptation sequence was followed by a intermediate Af 
Test sequence and vice versa, in randomized order (as in 
session one of Experiment 1). The second session com- 
pared small Af Adaptation with large Af Test sequences, 
and vice versa (as in session two of Experiment 1). In the 
third behavioral session a short ISI Adaptation sequence 
was followed by a long ISI Test sequence and vice versa 
(as in Experiment 2). All three 15 minute sessions took 
place on the same day two weeks after the MEG- 
measurements. Within each session 24 trials were 
presented, corresponding to 12 repetitions of each of the 
conditions (small versus intermediate Af; small versus 
large Af; short versus long duration ISI). In order to avoid 
the effect of "echoic memory" involving successive trials of 
active listening [34,42-44], the trial duration was shor- 
tened to 8 s (4 s of Adaptation and 4 s of Test) and the in- 
ter trial interval (ITI) was extended to 5 s. The 
participants were instructed to indicate the point at which 
their perception switched from one to two streams by 
pressing a mouse button during the presentation of each 
sequence. Prior to the experiment, the participants prac- 
ticed 4 sequences of each condition in order to restrict 
intentional biasing of the perception in favor of one of the 
streams (A or B tones) [3,5]. The responses to each 
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condition in each position as Adaptation or Test sequence 
were collected separately across all three sessions. 

MEG data acquisition 

The MEG recording was performed using a 275-channel 
whole-head system (Omega2005, VSM-Medtech, Port 
Coquitiam, BC, Canada), sampled at 600 Hz. The MEG 
measurements preceded the behavioral recordings in order 
to avoid possible effects of attention, intention and prior ex- 
perience. The participants had not received prior informa- 
tion about the paradigm, the stream segregation phenomena 
or the purpose of the study. They were comfortably seated 
in an upright position, instructed to ignore the sounds and 
watch a soundless video of their choice, presented on a 
monitor in front of them [16,22]. Eighty trials per ses- 
sion were recorded, yielding a sufficient number of 
MEG data. 

Data analysis of MEG data 

The MEG analysis was performed using the CTF 
software package. After applying a high-pass filter 
with cutoff-frequency of 1 Hz the MEG data were 
separated into epochs of 450 ms (from -50 to 
+400 ms relative to the stimulus onset of the ABA 
triplets). Epochs with magnetic field amplitudes 
(peak-to-peak) larger than 3 pT were excluded from 
further analysis as artifacts. After averaging, a 20 Hz 
low-pass filter was applied. Importantly, the 
responses to the first and last triplets in each condi- 
tion were excluded from the averaging procedure in 
order to avoid a stronger influence from the onset 
and offset of each sequence on the grand average 
data. 

Assuming the model of an equivalent current dipole 
(ECD) in a spherical volume conductor, a spatio- 
temporal dipole fit was performed. Tl-weighted mag- 
netic resonance imaging (MRI) of the head was obtained 
from all listeners on a 3 T Scanner (Gyroscan Intera 
T30, Philips, Amsterdam, Netherlands). The parameters 
(center location and radius) of the spherical head model 
were derived from the individual MRI. The interval used 
for the ECD fit (-30 ms) was placed around the local 
maximum of the PI component of the AEF. The PI di- 
polar sources evoked by tone B were less variable across 
conditions compared to the Nl sources and, thus, pro- 
vided a better signal-to-noise ratio. For each participant, 
two ECDs (one in each hemisphere) were determined, 
defined by their dipole moment, orientation and spatial 
coordinates (a goodness of fit larger than 85% was 
imposed). Median values of x, y, and z coordinates of 
the dipole location and the angles of the dipole orienta- 
tion were calculated for each condition. The individual 
averaged values for PI (across all conditions) were used 
to fix the dipole position and orientation and then the 



source space projection method [45] was applied to cal- 
culate the components of the transient evoked response 
(PI, Nl) for each condition. 

Statistical analysis of MEG data 

The major goal of this study was to examine the influence 
of the initial stimulation on the Test sequence condition. 
For that purpose, the responses of the same condition pre- 
sented in Adaptation and Test positions were compared 
(cross-checking comparison). 

Repeated measurements 2x2x2 ANOVA were per- 
formed for the peak amplitudes and latencies of PI 
and Nl components to the B tones of the ABA trip- 
let. The statistical analysis always included three fac- 
tors: "Hemisphere" (right (RH) and left (LH)), "Af " 
(Experiment 1) or "ISI" (Experiment 2) consisting of 
2 variables for each of the investigated Af values 
(Af =2 vs. Af = 4 or 10 semitones, or short ISI vs. 
long ISI) and "Part" (also containing two variables, 
showing that the stimulus was presented as an 
Adaptation or Test condition). For two participants, 
PI (one participant) and/or Nl components (one 
participant) could not be derived from the average 
response time course, and were not included in the 
group analysis. 

Four separate ANOVAs were performed for Experi- 
ment 1 and one further separate ANOVA for Experi- 
ment 2. For all statistical analyses Bonferroni correction 
was applied. 

In the first analyses the 2 and 4 semitone conditions 
from the first session of Experiment 1 were examined. 
This enabled us to test the effect of the initial 4 semi- 
tones condition on the subsequent 2 semitones Test and 
the effect of the initial 2 semitones on the 4 semitones 
Test by crosschecking the effects of the identical condi- 
tion presented in Adaptation and Test positions. An- 
other separate ANOVA was applied for the second 
session, to investigate the effect of 2 semitone Adapta- 
tions on the 10 semitone Test and vice versa. In the 
third analyses, the initial effect of the 2 semitone condi- 
tions on the 4 and 10 semitone Tests was examined. For 
that purpose, the 4 and 10 semitone conditions from the 
first and second sessions, presented in both Adaptation 
(for control) and Test positions were entered into an- 
other ANOVA. In a fourth ANOVA only the 2 semi- 
tones Test conditions from session one (following an 
initial 4 semitones) and session two (initial 10 semi- 
tones) of Experiment 1 were entered. In that way the ini- 
tial effects of 4 and 10 semitone Adaptation conditions 
on the neural activity of the 2 semitone Test were inves- 
tigated. In this analysis "A-f was labeled "session" (2 
semitone condition from session 1 and from session 2). 

A separate ANOVA-analysis was also used for Experi- 
ment 2, in the same way as for the first experiment. 
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Figure 2 Grand averaged source waveforms across all participants from Left and Right Hemispheres. (A) The two different degrees of 
frequency separation between A and B tones (Af = 2 semitones [black lines] and Af=4 semitones [grey lines]) which were presented in the first 
experimental session are shown. The responses recorded from their presentation as Adaptation (on top) or Test (below) sequences are shown. (B) 
The second experimental session presented stimuli of Af=2 (black lines) and Af= 10 semitones (grey lines), presented as Adaptation (top) and 
Test (below) sequences. 



Results 

Source waveform data 

Experiment 1. Frequency-based neuronal activity and 
lateralization effects 

The grand averaged responses from the first session of 
Experiment 1, are shown in Figure 2 A (small Af [2 



semitones] are indicated by black lines; intermediate Af 
[4 semitones] are in gray). The upper and the lower 
graphs display the responses to the Adaptation and Test 
sequences, respectively. The amplitude of the PI compo- 
nent was generally found to have increased for Af = 4 
semitones compared to Af = 2 semitones (significant 
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main effect Af [F(l,14) = 23.468, pK.OOl]), whereas the 
amplitude of the Nl component was only slightly stron- 
ger in Af = 4 semitones compared to Af = 2 semitones. 
The PI amplitudes were also slightly larger in the left 
compared to the right hemisphere. In contrast, the Nl 
amplitude was significantly enhanced in both conditions 
(4 and 2 semitones) in the right hemisphere compare to 
left hemisphere only on a Test position (interaction 
Hemisphere x Part for Nl amplitude [F(l,14) = 13.993, 
p < .05]). 

Figure 2B shows the grand average responses from the 
second session of Experiment 1, with the PI component 
significantly stronger in condition Af = 10 semitones 
compared to the Af = 2 semitones (significant main effect 
Af [F(l,13) = 12.598, p < .001]). This enhanced amplitude 
was more prominent when presented as a Test sequence 
(main effect "Part" for PI amplitude [F(l,13) = 5.610, p 
<.05] ). The Nl amplitude was also significantly 
enhanced when the 10 semitones condition was pre- 
sented as a Test sequence and it was more dominant in 
the RH (significant interaction Af x Part for Nl ampli- 
tude [F(l,13) = 8.384, p<.05] and main effect of Hemi- 
sphere [F(l,13) = 8.542, p < .05]). In addition, the Nl was 
significantly shortened in the RH for both 2 and 10 
semitones (significant main effect Hemisphere for Nl la- 
tency, [F(l,13) = 6.093, p < .05)]). 

The responses of PI and Nl for 4 semitones and 10 
semitones conditions from the first and second sessions 
of Experiment 1 (in adaptation and test position) were 
entered in another separate 2x2x2 model ANOVA, in 
order to compare the effects induced by the preceding 
small Af sequence (2 semitones). The statistical analysis 
showed that PI was significantly increased in the LH for 
both Af = 4 and Af = 10 semitones conditions presented 
as Test sequence, compare to adaptation position (sig- 
nificant interaction for Hemisphere x Part [F 
(1,13) = 11.747, p<. 05]). In contrast, the Nl amplitude 
was increased in the RH for these same conditions (sig- 
nificant main effect Hemisphere [F(l,13) = 6.759, p < .05]). 
This lateralization effect was more prominent for Af = 10 
semitones during the Test sequence (significant inter- 
action Hemisphere x Af [F(l,13) = 8.851, p < .05], and Af x 
Part [F(l,13) = 8.044, p < .05]). For the 4 and 10 semitones 
Af stimuli presented only on adaptation position (no initial 
stimulation), significant effects were obtained regarding 
the lateralization of the amplitude of Nl (main effect 
Hemisphere [F(l, 14) = 7.318, p<.05] and interaction 
Hemisphere x Af [F(l,14) = 6.676, p<.05]), the Nl was 
significantly larger in the LH for 4f=4 semitones, 
whereas for Af = 10 semitones the amplitude was stron- 
ger in the RH. 

A further separate ANOVA was applied to data from 
the first and second sessions in order to test the effects 
of an initial intermediate (Af = 4) and large (Af = 10) 



frequency separation on the small Af Test condition. 
Only the amplitude of PI was significantly enhanced 
when the 2 semitone stimulus was presented in the Test 
position condition compared to Adaptation position 
(significant main effect Part [F(l,13) = 6,331, p < .05]). 

Figure 3 summarizes the fluctuations in neuronal activ- 
ity and the lateralization effects for all Af values (2, 4 and 
10 semitones), presented as Test or Adaptation, separately 
for the PI and Nl components. Error bars indicate the 
95% confidence intervals for the within-subject effect (Part 
x Semitones), by Loftus & Masson [46]. 

Experiment 2. influence of the alternating ISI 

The short ISI condition was analyzed in the time- 
interval of -50 ms to 400 ms and the long ISI condition 
was analyzed in the time-interval of -50 ms to 800 ms. 
The results of these analyses are displayed in Figure 4A 
with Adaptation and Test conditions denoted by black 
and grey lines, respectively. Concerning the B-tone, sig- 
nificant main effects and interaction regarding the pos- 
ition in the trial and the lateralization were neither 
obtained for PI amplitude (main effect Hemisphere 
[F(l,13) = 1.397, p =0 .258], ISI [F(l,13) = 2.036, p =0 
.177], Part [F (1,13) = 3.201, p =0 .097] and Hemisphere 
x Part [F(l,13) = 2.431, p =0 .143]) nor for Nl amplitude 
(main effect Hemisphere [F( 1,1 3) = 2.024, p =0 .178], ISI 
[F(l,13) = 0.604, p =0 .451], Part [F( 1,1 3) = 0.640, p =0 
.438] and Hemisphere x Part" [F(l, 13) = 3.755, p =0 
.075]). 

Figure 4B shows the neuronal activity and the 
lateralization effects for short ISI and long ISI, separately 
for PI and Nl components, presented as Test or as 
Adaptation. Error bars represent 95% confidence inter- 
vals for the within subject effect Part x Hemisphere [46]. 

Behavioral data 

The results of the behavioral experiments are shown in 
Figure 5. The data obtained in response to small Af (2 
semitones) from the first two sessions were analyzed 
separately from the intermediate and large Af conditions 
(4 and 10 semitones). The statistical analysis revealed 
significantly better performance of streaming when Af of 
4 and 10 semitones stimuli were presented in the Test 
position compared to the same stimuli presented in the 
Adaptation position (significant main effect Part [F 
(1,11) = 14.286, p< .05]). In the third session no partici- 
pants reported the perception of two separate streams in 
the long ISI condition, regardless of its presentation as 
Test or Adaptation sequence. In contrast, all participants 
perceived two streams in the short ISI condition but 
there was no significant difference related to the position 
as Test or Adaptation sequence [£(11) = -1,149, p = 0. 
275; independent sample t test]. 
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Discussion 

Neurophysiologic correlates of stream segregation 
Effect of alternating the frequency separation (Experiment 1) 

To our knowledge, this study reports for the first time 
the differing behavior of PI and Nl components of 
human AEFs when the frequency separation between A 
and B tones during initial Adaptation and subsequent 
Test sequences of a streaming task have alternated. 
Neural activity generally increased as the frequency sep- 
aration between A and B tones increased, a finding 
which is in line with previous studies [16,21,22,35]. We 
found that in all conditions (Af of 2, 4 and 10 semitones) 
the PI enhancement was more prominent during the 
Test sequence but that the Nl amplitude was signifi- 
cantly increased only in the 10 semitone Test-condition. 



Neuronal activity underlying streaming is known to be 
modified by prior adaptation [28,35], but the reported 
modulations of PI and Nl components are not as spe- 
cific, as seen here. Snyder and colleagues, for instance 
[35] performed a similar EEG study with a two- 
consecutive-conditions design but with 1.44 s of silent 
interval between conditions. The Af of the Adaptation 
sequence in their study was 3, 6, or 12 semitones, fol- 
lowed by a Test sequence with a Af of 6 semitones. Un- 
like our design, the participants were required to pay 
attention to the stimuli. They reported that a small Af in 
the Adaptation sequence induces PI enhancement in re- 
sponse to the first A tone of the ABA triplet in the Test 
sequence. In the same manner, in our study the PI com- 
ponent was enhanced in all conditions on a Test 
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position, therefore the PI clearly shows the context ef- 
fect. We also found a significant enhancement of the Nl 
component in the case of the 10 semitone Test condi- 
tion following the initial 2 semitones Adaptation se- 
quence. This further modulation of the Nl could have 
been induced by the specific paradigm we used (two 
Adaptation and Test-sequences with no gap in between), 



because the contextual effect between consecutive tone 
patterns has been shown to depend upon the period of 
silence between them [2,31,40,41] and to decrease as a 
function of time [35]. Furthermore, the enhancement of 
neural activity which we observed during the intermedi- 
ate and large Af Test sequences following adaptation to 
a small Af reliably concurred with our predictions the 
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implications of previous research [35]. However, the 
amplitude of the PI component was also found to in- 
crease in the small Af-Test sequence following initial lar- 
ger Af stimulation, despite our expectation that neuronal 
activity would reduce. The PI component therefore 
appeared to be enhanced in all Test conditions, regard- 
less of the size of the initial Af Adaptation. This finding 
is contrary to previously reported psychophysical results 
[23] that demonstrated a reduced probability of two 
streams being perceived after adaptation to a larger Af 
pattern. On the other hand, Snyder and colleagues 
[23,34] proposed that multiple levels of representation 
could underlie the perceptual organization of sequential 
auditory patterns, and the facilitative perceptual effect 
and the stimulus driven-effect are proposed to be at least 
partially independent [34]. It has also been suggested 
that distinct cortical areas may be active in the encoding 
of the adaptation and test streaming patterns [35]. Given 
this suggestion, it is likely that the PI and Nl response 
components represent different modulation effects in 



the encoding of subsequent streaming patterns: PI 
demonstrates the adaptation effect (context) and Nl 
demonstrates the streaming effect. Further support for 
this assumption is in the dissimilar task sensitivity of 
both components [47], and this is in turn supported by 
evidence that PI and Nl are generated by different audi- 
tory cortical regions [48-51]. Moreover, distinct neuronal 
mechanisms are responsible for automatic auditory 
stream segregation and attention-dependent buildup 
processes [8,22,28,38], and attention operates in a more 
pronounced manner on Nl and later components (P2 
and N2) [36,51-53]. For this reason, additional modula- 
tion in the auditory cortical network could be expected 
in an active listening task. 

Lateralization effects, based on frequency alternations 

Our results from the first experiment showed lateralization- 
differences of the B tone-related PI and Nl components 
between different frequency separation conditions (2, 4 and 
10 semitones). The amplitude of the PI component was sig- 
nificantly increased for all conditions (Af of 2, 4 and 10 
semitones) in the LH during the Test sequence. In contrast, 
the Nl amplitude was significantly enhanced in the RH for 
higher frequency separations only (Af of 4 and 10 semi- 
tones). The Nl latency was also shortened in the case of the 
10 semitones stimulus in the Test, which is in line with the 
findings of Roberts & Poeppel who demonstrated that the 
Nl peak latency varies as a function of the stimulus fre- 
quency [54]. It has been also proposed that amplitude and 
latency lateralization of the Nl component has a common 
origin [55]; a proposal which can also be observed in our 
data (RH amplitude and latency lateralization of Nl). 
Regarding streaming, Snyder and colleagues demonstrated 
that the modulation of Nl occurred at the right but not at 
the left temporal electrode, suggesting an RH dominance 
for frequency separation-based segregation [22], in line with 
our findings. 

In summary, the differences in the modulation of the 
AEFs that were demonstrated in the first experiment 
suggest that PI and Nl represent at least partially- 
different mechanisms both of which are active during 
the automatic segregation of subsequent streaming pat- 
terns. The two components also showed different 
lateralization behavior: PI was commonly enhanced in 
the LH in all conditions, whereas Nl was increased sig- 
nificantly only in the RH during the 10 semitone Test 
sequence. 

Effect of alternating the ISI (Experiment 2) 

The goal of the second experiment was to test the effect 
of alternating the ISI between A and B tones in consecu- 
tive Adaptation and Test conditions in an uninterrupted 
trial. The frequencies of A and B tones were kept con- 
stant during the whole trial; therefore the tonotopic 
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representation was identical. The morphology of the 
AEF responses to the short ISI condition in both the 
Test and Adaptation positions was almost identical and 
not significantly different in comparison to long ISI 
responses. Thus, the presentation of an initial long ISI 
condition did not influence the segregation process dur- 
ing the following short ISI sequence and vice versa. 
Since adaptation in the auditory cortex is dependent not 
only on the specific frequency but also on the concrete 
repetition rate of the stimuli [13,56,57], it could be sug- 
gested that the different ISIs which were used in the 
consecutive Adaptation and Test sequences caused dis- 
ruption of the frequency-specific adaptation to A and B 
tones. Both sequences were therefore processed as dis- 
tinct temporal events, without the contextual influence 
of the neural activity of one upon the other, regardless of 
having the same tonotopic representation. Based on that 
finding, it could be suggested that the modulation effects 
reported in the first experiment reflect the repeated 
presentation of specific tone frequencies but not of neu- 
rons tuned to a certain Af-shift [3]. This idea has also 
been tested in a recent psychophysical study by using 
Adaptation sequence A tones of greatly different fre- 
quency to those of the Test sequence, and B tones of ei- 
ther 3 or 12 semitones above the A tone frequency [23]. 
The presentation of a small or large Af initial sequence 
did not affect the perception of streaming during the 
test, supporting the idea that the context effect reflects 
adaptation to the repeated presentation of specific tone 
frequencies [23]. It should be mentioned, however, that 
the ISI values used in the present experiment were ex- 
treme (one very long ISI and one half the length) and 
that these conclusions regarding the concrete ISIs and 
modulation of the neural activity are therefore not ne- 
cessarily applicable to other ISI conditions. 

Behavioral study 

The behavioral study was conducted in order to evaluate 
the participants' perception using the same paradigm as 
in the MEG measurements. All aspects of the previous 
MEG experiments were replicated as well as the behav- 
ioral measurements. The results revealed that the likeli- 
hood of reporting two streams increased as Af increased, 
a finding which is in line with previous studies 
[16,21,22,35,39]. The main purpose was to evaluate the 
context effects of initial adaptation on the streaming 
perception during the Test sequence, in order to com- 
pare them with the neurophysiological findings. The 
results demonstrated a greater probability perceptual 
streaming in response to large and intermediate Af in 
the Test sequence position following a small Af Adapta- 
tion sequence compared with a large or intermediate Af 
on Adaptation position. No significant effects or interac- 
tions were found regarding the perception of small Af 



Test sequence when preceded by large (10 semitones) 
and intermediate (4 semitones) Af Adaptation presenta- 
tions despite the fact that the PI component appeared 
to be enhanced in the MEG outcome of our study. The 
lack of perceptual modulation during the 2 semitone 
Test sequence could have been caused by using a small 
Af because the perceptual context effect was reported to 
be strongest at intermediate Af values [23,34,35]. Fur- 
thermore, it has been proposed that streaming build-up 
occurs at levels of neuronal representation with sharp 
frequency tuning, unlike the effect of prior context 
which is caused by the adaptation of neurons with wide 
frequency tuning [23]. Thus, modulations in neural ac- 
tivity were reported in our neurophysiological experi- 
ments in all Test conditions (enhanced PI component), 
unlike the perceptual effect which was less responsive to 
the initial large Af Adaptation conditions. 

The behavioral data obtained when altering the ISI, 
showed no significant differences in the short ISI Test 
condition when preceded by the long ISI Adaptation se- 
quence and vice-versa. We therefore conclude that adap- 
tation sequences of different ISIs do not affect the 
subsequent perception of streaming. This psychophysical 
finding compliments the results of the corresponding 
MEG experiment which showed no AEF modulation fol- 
lowing variation of ISI in the Adaptation sequence. As 
proposed above, the neurons undergoing the adaptation 
effect are sensitive not only to a specific frequency 
range, but also to the concrete repetition rate of the pre- 
senting stimuli [13,56,57]. Our findings lend further sup- 
port to the idea that the initial adaptation that facilitates 
the streaming process is rather a peripheral mechanism 
that depends on simple tone frequencies but not higher 
order auditory features [23]. 

In addition, it should be mentioned that the perceptual 
state cannot be directly ascribed to the electrophysio- 
logical data seen in prior pre-attentive MEG experi- 
ments, since the buildup of streaming requires several 
seconds [2,31]. Moreover, different levels of representa- 
tion underlying the contextual influence in auditory 
stream segregation have been suggested by Snyder and 
colleagues [23] as pointed out above, and the effect of 
the initial context that facilitates the streaming percep- 
tion appeared to be different from the neural effect 
caused by the prior small or large Af-based adaptation 
[28,35]. 

Conclusions 

Our MEG experiment demonstrates dissimilar behavior 
in PI and Nl response components during the auto- 
matic segregation of sound, induced by an initial Adap- 
tation sequence. The PI component appeared to be 
enhanced in all Test conditions and thus demonstrates 
the preceding context effect, whereas Nl was specifically 
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modulated only in the large frequency separation Test 
condition induced by a preceding small Af Adaptation 
sequence. This finding combined with the difference we 
observed in the lateralization of PI and Nl could sug- 
gest that both components represent at least partially- 
different systems underlying the perceptual organization 
of streaming patterns. The psychophysical results of the 
parallel behavioral study show that prior adaptation to a 
smaller degree of frequency separation facilitates the 
streaming perception. However, neither the neural activ- 
ity nor the perception of successive streaming sequences 
were modulated when the inter stimulus intervals were 
varied, thus lending support to the idea that the initial 
modulation effect reflects adaptation to the repeated 
presentation of specific tone-frequencies. 
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